Detailed Radiation Fault Modeling of the Remote Exploration and Experimentation (REE) First Generation Testbed Architecture
نویسندگان
چکیده
-The goal of the NASA HPCC Remote Exploration and Experimentation (REE) Project is to transfer commercial supercomputing technology into space. The project will use state of the art, low-power, non-radiationhardened, Commercial Off-The-Shelf (COTS) hardware chips and COTS software to the maximum extent possible, and will rely on Software-Implemented Fault Tolerance (SIFT) to provide the required levels of availability and reliability. In this paper, we outline the methodology used to develop a detailed radiation fault model for the REE Testbed architecture. The model addresses the effects of energetic protons and heavy ions which cause Single Event Upset (SEU) and Single Event Multiple Upset (SEMU) events in digital logic devices and which are expected to be the primary fault generation mechanism. Unlike previous modeling efforts, this model will address fault rates and types in computer subsystems at a sufficiently fine level of granularity (i.e., the register level) that specific software and operational errors can be derived. We present the current state of the model, model verification activities and results to date, and plans for the future. Finally, we explain the methodology by which this model will be used to derive application-level error effects sets. These error effects sets will be used in conjunction with our Testbed fault injection capabilities and our applications’ mission scenarios to replicate the predicted fault environment on our suite of onboard applications.
منابع مشابه
Detailed Radiation Fault Modeling of the Remote Exploration and Experimentation ( W E ) First Generation Testbed Architecture
-The goal f the NASA HPCC Remote Exploration and Experimentation (REE) Project is to transfer commercial supercomputing technology into space. The project will use state of the art, low-power, nonradiation-hardened, commercial Off-The-Shelf (COTS) hardware chips and COTS software to the maximum extent possible, and will rely on Software-Implemented Fault Tolerance (SIFT) to provide the required...
متن کاملRee: a Cots-based Fault Tolerant Parallel Processing Supercomputer for Spacecraft Onboard Scientific Data Analysis
NASA’s future spaceborne science missions will require supercomputing capabilities for both near earth and deep space exploration. Limited downlink bandwidth and excessive round trip communication delays limit the capabilities and science value of missions which rely on terrestrial supercomputing resources. Projects such as the Gamma ray Large Area Space Telescope (GLAST), the Next Generation S...
متن کاملReliability and Availability Analysis for the JPL Remote Exploration and Experimentation System
The NASA Remote Exploration and Experimentation (REE) Project, managed by the Jet Propulsion Laboratory, has the vision of bringing commercial supercomputing technology into space, in a form which meets the demanding environmental requirements, to enable a new class of science investigation and discovery. Dependability goals of the REE system are 99% reliability over 5 years and 99% availabilit...
متن کاملDemonstration of the Remote Exploration and Experimentation (REE) Fault-Tolerant Parallel-Processing Supercomputer for Spacecraft Onboard Scientific Data Processing
This paper is the written explanation for a demonstration of the REE Project’s work to-date. The demonstration is intended to simulate an REE system that might exist on a Mars Rover, consisting of multiple COTS processors, a COTS network, a COTS node-level operating system, REE middleware, and an REE application. The specific application pecforms texture processing of images. It was chosen as a...
متن کامل. Agency Use Only(leave Blank) 2. Report Date 3. Report Type and Dates Covered Generalized Hypercube Structures and Hyperswitch Communication Network
One of the Grand Challenges of the Federal High Performance Computing and Communications (HPCC) Program is in remote exploration and experimentation (REE). The goal of the REE Project is to develop a space-borne computing technology base that will enable the next generation of missions to explore the Earth and the Solar System. This paper discusses an ongoing study that uses a recent developmen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000